AITopics | tidy analysis

Collaborating Authors

tidy analysis

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Exploring handwritten digit classification: a tidy analysis of the MNIST dataset

@machinelearnbotFeb-1-2018, 23:22:05 GMT

In a recent post, I offered a definition of the distinction between data science and machine learning: that data science is focused on extracting insights, while machine learning is interested in making predictions. I use both machine learning and data science in my work: I might fit a model on Stack Overflow traffic data to determine which users are likely to be looking for a job (machine learning), but then construct summaries and visualizations that examine why the model works (data science). This is an important way to discover flaws in your model, and to combat algorithmic bias. This is one reason that data scientists are often responsible for developing machine learning components of a product. I'd like to further explore how data science and machine learning complement each other, by demonstrating how I would use data science to approach a problem of image classification.

artificial intelligence, digit, machine learning, (14 more...)

@machinelearnbot

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Examining the arc of 100,000 stories: a tidy analysis

@machinelearnbotMay-6-2017, 15:20:13 GMT

I recently came across a great natural language dataset from Mark Riedel: 112,000 plots of stories downloaded from English language Wikipedia. This includes books, movies, TV episodes, video games- anything that has a Plot section on a Wikipedia page. This offers a great opportunity to analyze story structure quantitatively. In this post I'll do a simple analysis, examining what words tend to occur at particular points within a story, including words that characterize the beginning, middle, or end. As I usually do for text analysis, I'll be using the tidytext package Julia Silge and I developed last year.

artificial intelligence, natural language, tidy analysis, (15 more...)

@machinelearnbot

Country: North America > United States > California > Los Angeles County > Los Angeles (0.15)

Industry: Leisure & Entertainment (0.55)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.90)

Add feedback